Content-Based Social Network Analysis of Mailing Lists
نویسندگان
چکیده
Abstract Social Network Analysis (SNA) provides tools to examine relationships between people. Text Mining (TM) allows capturing the text they produce in Web 2.0 applications, for example, however it neglects their social structure. This paper applies an approach to combine the two methods named “content-based SNA”. Using the R mailing lists, R-help and R-devel, we show how this combination can be used to describe people’s interests and to find out if authors who have similar interests actually communicate. We find that the expected positive relationship between sharing interests and communicating gets stronger as the centrality scores of authors in the communication networks increase.
منابع مشابه
Testing Generative Models of Online Collaboration with BigBang
We introduce BigBang, a new Python toolkit for analyzing online collaborative communities such as those that build open source software. Mailing lists serve as critical communications infrastructure for many communities, including several of the open source software development communities that build scientific Python packages. BigBang provides tools for analyzing mailing lists. As a demonstrat...
متن کاملPALADIN: A Pattern Based Approach to Knowledge Discovery in Digital Social Networks
Digital media are used to facilitate social structures thus building digital social networks. Disturbances in such networks occur on different levels (egocentric level, subgroup level, network) and have to be analyzed in the multidimensional context of reference disciplines like sociology and knowledge management. This paper presents a first repository of disturbance patterns for the analysis o...
متن کاملPredicting Email Response using Mined Data
Mailing lists are the primary medium of communication in open source projects. For some projects the sheer volume of emails on the mailing lists becomes unmanageable and messages may begin to be ignored. This can have a number of negative effects on an open source project. We present a way to predict who is most likely to respond to an email, thus providing the potential of giving mailing list ...
متن کاملSocial Network Structure Behind the Mailing Lists: ICT-IIIS at TREC 2006 Expert Finding Track
Expert finding system is a challenging problem in the enterprise environment. This paper introduce our research and experiments on TREC 2006’s expert searching track. In our experiments, we find some interesting features of the community structures in the mailing list network. We also use some link analysis approaches to rank the candidates in the social networks. In our experiments, we choose ...
متن کاملApplying an XML Warehouse to Social Network Analysis, Lessons from the WebStand Project
In this paper we present the state of advancement of the French ANR WebStand project. The objective of this project is to construct a customizable XML based warehouse platform to acquire, transform, analyze, store, query and export data from the web, in particular mailing lists, with the final intension of using this data to perform sociological studies focused on social groups of World Wide We...
متن کامل